Positive Unlabeled Leaning for Time Series Classification
نویسندگان
چکیده
In many real-world applications of the time series classification problem, not only could the negative training instances be missing, the number of positive instances available for learning may also be rather limited. This has motivated the development of new classification algorithms that can learn from a small set P of labeled seed positive instances augmented with a set U of unlabeled instances (i.e. PU learning algorithms). However, existing PU learning algorithms for time series classification have less than satisfactory performance as they are unable to identify the class boundary between positive and negative instances accurately. In this paper, we propose a novel PU learning algorithm LCLC (Learning from Common Local Clusters) for time series classification. LCLC is designed to effectively identify the ground truths’ positive and negative boundaries, resulting in more accurate classifiers than those constructed using existing methods. We have applied LCLC to classify time series data from different application domains; the experimental results demonstrate that LCLC outperforms existing methods significantly.
منابع مشابه
Ensemble Based Positive Unlabeled Learning for Time Series Classification
Many real-world applications in time series classification fall into the class of positive and unlabeled (PU) learning. Furthermore, in many of these applications, not only are the negative examples absent, the positive examples available for learning can also be rather limited. As such, several PU learning algorithms for time series classification have recently been developed to learn from a s...
متن کاملIterative Extreme Learning Machine for Single Class Classifier using General Mapping Convergence framework
Single Class Classification (SCC) is the problem to distinguish one class of data (called positive class) from the rest data of multiple classes (negative class). SCC problems are common in real world where positive and unlabeled data are available but negative data is expensive or very hard to acquire. In this paper, extreme leaning machine (ELM), a recently developed machine learning algorith...
متن کاملLearning to Rank Biomedical Documents with only Positive and Unlabeled Examples: A Case Study
In the text mining field, obtaining training data requires human experts' labeling efforts, which is often time consuming and expensive. Supervised learning with only a small number of positive examples and a large amount of unlabeled data, which is easy to get, has attracted booming interests in the field. A recently proposed relabeling method, which assumes unlabeled data as negative data for...
متن کاملSemi-Supervised Sequence Classification with HMMs
Using unlabeled data to help supervised learning has become an increasingly attractive methodology and proven to be effective in many applications. This paper applies semi-supervised classification algorithms, based on hidden Markov models (HMMs), to classify sequences. For model-based classification, semisupervised learning amounts to using both labeled and unlabeled data to train model parame...
متن کاملSemi-supervised Learning from Only Positive and Unlabeled Data Using Entropy
The problem of classification from positive and unlabeled examples attracts much attention currently. However, when the number of unlabeled negative examples is very small, the effectiveness of former work has been decreased. This paper propose an effective approach to address this problem, and we firstly use entropy to selects the likely positive and negative examples to build a complete train...
متن کامل